77 research outputs found
TransNFCM: Translation-Based Neural Fashion Compatibility Modeling
Identifying mix-and-match relationships between fashion items is an urgent
task in a fashion e-commerce recommender system. It will significantly enhance
user experience and satisfaction. However, due to the challenges of inferring
the rich yet complicated set of compatibility patterns in a large e-commerce
corpus of fashion items, this task is still underexplored. Inspired by the
recent advances in multi-relational knowledge representation learning and deep
neural networks, this paper proposes a novel Translation-based Neural Fashion
Compatibility Modeling (TransNFCM) framework, which jointly optimizes fashion
item embeddings and category-specific complementary relations in a unified
space via an end-to-end learning manner. TransNFCM places items in a unified
embedding space where a category-specific relation (category-comp-category) is
modeled as a vector translation operating on the embeddings of compatible items
from the corresponding categories. By this way, we not only capture the
specific notion of compatibility conditioned on a specific pair of
complementary categories, but also preserve the global notion of compatibility.
We also design a deep fashion item encoder which exploits the complementary
characteristic of visual and textual features to represent the fashion
products. To the best of our knowledge, this is the first work that uses
category-specific complementary relations to model the category-aware
compatibility between items in a translation-based embedding space. Extensive
experiments demonstrate the effectiveness of TransNFCM over the
state-of-the-arts on two real-world datasets.Comment: Accepted in AAAI 2019 conferenc
Diffusion Variational Autoencoder for Tackling Stochasticity in Multi-Step Regression Stock Price Prediction
Multi-step stock price prediction over a long-term horizon is crucial for
forecasting its volatility, allowing financial institutions to price and hedge
derivatives, and banks to quantify the risk in their trading books.
Additionally, most financial regulators also require a liquidity horizon of
several days for institutional investors to exit their risky assets, in order
to not materially affect market prices. However, the task of multi-step stock
price prediction is challenging, given the highly stochastic nature of stock
data. Current solutions to tackle this problem are mostly designed for
single-step, classification-based predictions, and are limited to low
representation expressiveness. The problem also gets progressively harder with
the introduction of the target price sequence, which also contains stochastic
noise and reduces generalizability at test-time. To tackle these issues, we
combine a deep hierarchical variational-autoencoder (VAE) and diffusion
probabilistic techniques to do seq2seq stock prediction through a stochastic
generative process. The hierarchical VAE allows us to learn the complex and
low-level latent variables for stock prediction, while the diffusion
probabilistic model trains the predictor to handle stock price stochasticity by
progressively adding random noise to the stock data. Our Diffusion-VAE (D-Va)
model is shown to outperform state-of-the-art solutions in terms of its
prediction accuracy and variance. More importantly, the multi-step outputs can
also allow us to form a stock portfolio over the prediction length. We
demonstrate the effectiveness of our model outputs in the portfolio investment
task through the Sharpe ratio metric and highlight the importance of dealing
with different types of prediction uncertainties.Comment: CIKM 202
Causal Disentangled Recommendation Against User Preference Shifts
Recommender systems easily face the issue of user preference shifts. User
representations will become out-of-date and lead to inappropriate
recommendations if user preference has shifted over time. To solve the issue,
existing work focuses on learning robust representations or predicting the
shifting pattern. There lacks a comprehensive view to discover the underlying
reasons for user preference shifts. To understand the preference shift, we
abstract a causal graph to describe the generation procedure of user
interaction sequences. Assuming user preference is stable within a short
period, we abstract the interaction sequence as a set of chronological
environments. From the causal graph, we find that the changes of some
unobserved factors (e.g., becoming pregnant) cause preference shifts between
environments. Besides, the fine-grained user preference over categories
sparsely affects the interactions with different items. Inspired by the causal
graph, our key considerations to handle preference shifts lie in modeling the
interaction generation procedure by: 1) capturing the preference shifts across
environments for accurate preference prediction, and 2) disentangling the
sparse influence from user preference to interactions for accurate effect
estimation of preference. To this end, we propose a Causal Disentangled
Recommendation (CDR) framework, which captures preference shifts via a temporal
variational autoencoder and learns the sparse influence from multiple
environments. Specifically, an encoder is adopted to infer the unobserved
factors from user interactions while a decoder is to model the interaction
generation process. Besides, we introduce two learnable matrices to disentangle
the sparse influence from user preference to interactions. Lastly, we devise a
multi-objective loss to optimize CDR. Extensive experiments on three datasets
show the superiority of CDR.Comment: This paper has been accepted for publication in Transactions on
Information System
Context-aware Event Forecasting via Graph Disentanglement
Event forecasting has been a demanding and challenging task throughout the
entire human history. It plays a pivotal role in crisis alarming and disaster
prevention in various aspects of the whole society. The task of event
forecasting aims to model the relational and temporal patterns based on
historical events and makes forecasting to what will happen in the future. Most
existing studies on event forecasting formulate it as a problem of link
prediction on temporal event graphs. However, such pure structured formulation
suffers from two main limitations: 1) most events fall into general and
high-level types in the event ontology, and therefore they tend to be
coarse-grained and offers little utility which inevitably harms the forecasting
accuracy; and 2) the events defined by a fixed ontology are unable to retain
the out-of-ontology contextual information. To address these limitations, we
propose a novel task of context-aware event forecasting which incorporates
auxiliary contextual information. First, the categorical context provides
supplementary fine-grained information to the coarse-grained events. Second and
more importantly, the context provides additional information towards specific
situation and condition, which is crucial or even determinant to what will
happen next. However, it is challenging to properly integrate context into the
event forecasting framework, considering the complex patterns in the
multi-context scenario. Towards this end, we design a novel framework named
Separation and Collaboration Graph Disentanglement (short as SeCoGD) for
context-aware event forecasting. Since there is no available dataset for this
novel task, we construct three large-scale datasets based on GDELT.
Experimental results demonstrate that our model outperforms a list of SOTA
methods.Comment: KDD 2023, 9 pages, 7 figures, 4 table
Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction
Automatic bundle construction is a crucial prerequisite step in various
bundle-aware online services. Previous approaches are mostly designed to model
the bundling strategy of existing bundles. However, it is hard to acquire
large-scale well-curated bundle dataset, especially for those platforms that
have not offered bundle services before. Even for platforms with mature bundle
services, there are still many items that are included in few or even zero
bundles, which give rise to sparsity and cold-start challenges in the bundle
construction models. To tackle these issues, we target at leveraging multimodal
features, item-level user feedback signals, and the bundle composition
information, to achieve a comprehensive formulation of bundle construction.
Nevertheless, such formulation poses two new technical challenges: 1) how to
learn effective representations by optimally unifying multiple features, and 2)
how to address the problems of modality missing, noise, and sparsity problems
induced by the incomplete query bundles. In this work, to address these
technical challenges, we propose a Contrastive Learning-enhanced Hierarchical
Encoder method (CLHE). Specifically, we use self-attention modules to combine
the multimodal and multi-item features, and then leverage both item- and
bundle-level contrastive learning to enhance the representation learning, thus
to counter the modality missing, noise, and sparsity problems. Extensive
experiments on four datasets in two application domains demonstrate that our
method outperforms a list of SOTA methods. The code and dataset are available
at https://github.com/Xiaohao-Liu/CLHE
Current-oscillator correlation and Fano factor spectrum of quantum shuttle with finite bias voltage and temperature
A general master equation is derived to describe an electromechanical
single-dot transistor in the Coulomb blockade regime. In the equation, Fermi
distribution functions in the two leads are taken into account, which allows
one to study the system as a function of bias voltage and temperature of the
leads. Furthermore, we treat the coherent interaction mechanism between
electron tunneling events and the dynamics of excited vibrational modes.
Stationary solutions of the equation are numerically calculated. We show
current through the oscillating island at low temperature appears step like
characteristics as a function of the bias voltage and the steps depend on mean
phonon number of the oscillator. At higher temperatures the current steps would
disappear and this event is accompanied by the emergence of thermal noise of
the charge transfer. When the system is mainly in the ground state, zero
frequency Fano factor of current manifests sub-Poissonian noise and when the
system is partially driven into its excited states it exhibits super-Poissonian
noise. The difference in the current noise would almost be removed for the
situation in which the dissipation rate of the oscillator is much larger than
the bare tunneling rates of electrons.Comment: 14 pages, 8 figure
- …